Indicators Ratings Parallel =  7 ms
Indicators Ratings =  2 ms
CPU compute ratings: 1 iteration avg time: 570 ms
 
 ---- 
cuda_malloc<d_ratings> 0.65
cuda_malloc<d_companies_map> 0.45
cuda_malloc<d_indicatorsRatings> 1.94
kernel<computeRatingsKernel> 0.63
kernelSegment<sort> 1.54
toHostMemory<d_ratings> 0.39
toHostMemory<d_companies_map> 0.35
toHostMemory<d_indicatorsRatings> 7.87
ranking 0.57
TotalRankingTime 14.71

cuda_malloc<d_ratings> 0.28
cuda_malloc<d_companies_map> 0.07
cuda_malloc<d_indicatorsRatings> 0.07
kernel<computeRatingsKernel> 0.6
kernelSegment<sort> 0.51
toHostMemory<d_ratings> 0.35
toHostMemory<d_companies_map> 0.39
toHostMemory<d_indicatorsRatings> 8.33
ranking 0.56
TotalRankingTime 11.5

cuda_malloc<d_ratings> 0.24
cuda_malloc<d_companies_map> 0.08
cuda_malloc<d_indicatorsRatings> 0.07
kernel<computeRatingsKernel> 0.58
kernelSegment<sort> 0.56
toHostMemory<d_ratings> 0.36
toHostMemory<d_companies_map> 0.59
toHostMemory<d_indicatorsRatings> 8.67
ranking 0.74
TotalRankingTime 12.16

GRID K520 :  797.000 Mhz   (Ordinal 0)
8 SMs enabled. Compute Capability sm_30
FreeMem:   3785MB   TotalMem:   4096MB   64-bit pointers.
Mem Clock: 2500.000 Mhz x 256 bits   (160.0 GB/s)
ECC Disabled


